Goto

Collaborating Authors

 rule weight




Policy Gradient Reinforcement Learning for Policy Represented by Fuzzy Rules: Application to Simulations of Speed Control of an Automobile

arXiv.org Artificial Intelligence

A method of a fusion of fuzzy inference and policy gradient reinforcement learning has been proposed that directly learns, as maximizes the expected value of the reward per episode, parameters in a policy function represented by fuzzy rules with weights. A study has applied this method to a task of speed control of an automobile and has obtained correct policies, some of which control speed of the automobile appropriately but many others generate inappropriate vibration of speed. In general, the policy is not desirable that causes sudden time change or vibration in the output value, and there would be many cases where the policy giving smooth time change in the output value is desirable. In this paper, we propose a fusion method using the objective function, that introduces defuzzification with the center of gravity model weighted stochastically and a constraint term for smoothness of time change, as an improvement measure in order to suppress sudden change of the output value of the fuzzy controller. Then we show the learning rule in the fusion, and also consider the effect by reward functions on the fluctuation of the output value. As experimental results of an application of our method on speed control of an automobile, it was confirmed that the proposed method has the effect of suppressing the undesirable fluctuation in time-series of the output value. Moreover, it was also showed that the difference between reward functions might adversely affect the results of learning.


Optimization of Fuzzy Controller of a Wind Power Plant Based on the Swarm Intelligence

arXiv.org Artificial Intelligence

The article considers the problem of the optimal control of a wind power plant based on fuzzy control and automation of generating the fuzzy rule base. Fuzzy rules by experts do not always provide a maximum power output of the wind plant and fuzzy rule bases require an adjustment in the case of changing the parameters of the wind power plant or the environment. This research proposes the method for optimizing the fuzzy rules base compiled by various experts. The method is based on balancing weights of fuzzy rules into the base by the Particle Swarm Optimization algorithm. The experiment has shown that the proposed method allows forming the fuzzy rule base as an exemplary optimal base from a non-optimized set of fuzzy rules. The optimal fuzzy rule base has been taken under consideration for the concrete control loop of wind power plant and the concrete fuzzy model of the wind.


The Automatic Training of Rule Bases that Use Numerical Uncertainty Representations

arXiv.org Artificial Intelligence

The use of numerical uncertainty representations allows better modeling of some aspects of human evidential reasoning. It also makes knowledge acquisition and system development, test, and modification more difficult. We propose that where possible, the assignment and/or refinement of rule weights should be performed automatically. We present one approach to performing this training - numerical optimization - and report on the results of some preliminary tests in training rule bases. We also show that truth maintenance can be used to make training more efficient and ask some epistemological questions raised by training rule weights.